Comparative Analysis of Parametric and Non-Parametric Data-Driven Models to Predict Road Crash Severity among Elderly Drivers Using Synthetic Resampling Techniques

نویسندگان

چکیده

As the global elderly population continues to rise, risk of severe crashes among drivers has become a pressing concern. This study presents comprehensive examination crash severity this demographic, employing machine learning models and data gathered from Virginia, United States America, between 2014 2021. The analysis integrates parametric models, namely logistic regression linear discriminant (LDA), as well non-parametric like random forest (RF) extreme gradient boosting (XGBoost). Central is application resampling techniques, specifically, over-sampling examples (ROSE) synthetic minority technique (SMOTE), address dataset’s inherent imbalance enhance models’ predictive performance. Our findings reveal that inclusion these techniques significantly improves power notably increasing true positive rate for prediction 6% 60% geometric mean 25% 69% in regression. Likewise, SMOTE resulted notable improvement performance, leading increase 8% 36% XGBoost. Moreover, established superiority over counterparts when balanced are utilized. Beyond modeling, delves into effects various contributing factors on severity, enhancing understanding how influence road safety. Ultimately, underscore immense potential analyzing complex data, pinpointing heighten informing targeted interventions mitigate risks driving.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stochastic Non-Parametric Frontier Analysis

In this paper we develop an approach that synthesizes the best features of the two main methods in the estimation of production efficiency. Specically, our approach first allows for statistical noise, similar to Stochastic frontier analysis, and second, it allows modeling multiple-inputs-multiple-outputs technologies without imposing parametric assumptions on production relationship, similar to...

متن کامل

Parametric and non-parametric statistical analysis of DT-MRI data.

In this work parametric and non-parametric statistical methods are proposed to analyze Diffusion Tensor Magnetic Resonance Imaging (DT-MRI) data. A Multivariate Normal Distribution is proposed as a parametric statistical model of diffusion tensor data when magnitude MR images contain no artifacts other than Johnson noise. We test this model using Monte Carlo (MC) simulations of DT-MRI experimen...

متن کامل

Evaluation of Grain Yield Stability of Durum Wheat Genotypes Using Parametric and Non-Parametric Methods

Durum wheat (Triticum turgidum L.), like most other crops, is affected by various stresses. Therefore, cultivars that, in addition to the ability to produce higher yields, can maintain their yield potential in different years and locations are considered superior cultivars. In order to obtain high-yielding and stable genotypes of durum wheat, 16 lines with two control cultivars Dehdasht and Sey...

متن کامل

Trend analysis of discharge changes in the watershed of Nahrin Tabas Dam watershed using parametric and non-parametric methods

The trend of river discharge changes in arid and semi-arid regions is mostly decreasing and this trend can be influenced by climatic factors and rising temperatures in recent decades in these areas. This reduction in discharge could have a significant impact on the water storage of the Nahrin Dam and agriculture, industry and drinking water in Tabas City, which should be considered by experts. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Sustainability

سال: 2023

ISSN: ['2071-1050']

DOI: https://doi.org/10.3390/su15139878